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Restriction endonucleases act by recognizing and 
binding to particular sequences of nucleotides (the 
'recognition sequence') along the DNA molecule. Once 
bound, the type II endonucleases cleave the molecule 
5 within, or to one side of, the sequence. Different 

restriction endonucleases have affinity for different 
recognition sequences. The majority of restriction 
endonucleases recognize sequences of 4 to 6 nucleotides 
in length, although recently a small number of 

10 restriction endonucleases which recognize 7 or 8 

uniquely specified nucleotides have been isolated. Most 
recognition sequences contain a dyad axis of symmetry 
and in most cases all the nucleotides are uniquely 
specified. However, some restriction endonucleases have 

15 degenerate or relaxed specificities in that they 

recognize multiple bases at one or more positions in 
their recognition sequence, and some restriction 
endonucleases recognize asymmetric sequences. Haelll, 
which recognizes the sequence 5'-GGCC-3', is an example 

20 of a restriction endonuclease having a symmetrical, non- 

degenerate recognition sequence; Haell, which recognizes 
5 ' - (Pu)GCGC (Py) -3 ' typifies restriction endonucleases 
having a degenerate or relaxed recognition sequence; 
while BspMI, which recognizes 5'-ACCTGC-3' typifies 

25 restriction endonucleases having an asymmetric 

recognition sequence. Type II endonucleases with 
symmetrical recognition sequences generally cleave 
symmetrically within or adjacent to the recognition 
site, while those that recognize asymmetric sequences 

30 tend to cleave at a distance of from 1 to 20 nucleotides 

to one side of the recognition site. The enzyme of this 
application, lytoiel, (along with CstMI) has the 
distinction of cleaving the DNA at the farthest distance 
from the recognition sequence of any known type II 

35 restriction endonuclease. More than two hundred unique 



Docket No.: NEB-207-US 



restriction endonucleases have been identified among 
several thousands of bacterial species that have been 
examined to date. 

5 A second coirponent of restriction systems are the 

modification methylases. These enzymes are 
complementary to restriction endonucleases and they 
provide the means by which bacteria are able to protect 
their own DNA and distinguish it from foreign, infecting 

10 DNA. Modification methylases recognize and bind to the 

same nucleotide recognition sequence as the 
corresponding restriction endonuclease, but instead of 
breaking the DNA, they chemically modify one or other of 
the nucleotides within the sequence by the addition of a 

15 methyl group. Following methylation, the recognition 

sequence is no longer cleaved by the restriction 
endonuclease. The DNA of a bacterial cell is modified 
by virtue of the activity of its modification methylase 
and it is therefore insensitive to the presence of the 

20 endogenous restriction endonuclease. It is only 

unmodified, and therefore identifiably foreign, DNA that 
is sensitive to restriction endonuclease recognition and 
cleavage. Modification methyl transferases are usually 
separate enzymes from their cognate endonuclease 

25 partners. In some cases, there is a single polypeptide 

that possesses both a modification methyl transferase 
function and an endonuclease function, for example, 
Eco57I. In such cases, there is a second 
methyltransferase present as part of the restriction- 

30 modification system. In contrast, the Mnel system of the 

present application has no second methyltransferase 
accompanying the endonuclease-methyltransf erase 
polypeptide , 
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Endonucleases are named according to the bacteria 
from which they are derived. Thus, the species 
Haemophilus aegyptius, for example synthesizes 3 
different restriction endonuc leases, named Hael, Haell 
5 and Haelll. These enzymes recognize and cleave the 

sequences 5 ' - (W)GGCC (W) -3 ' , 5 ' - (Pu) GCGC (Py) -3 ' and 5'- 
GGCC-3* respectively. Escherichia coli RY13, on the 
other hand, synthesizes only one enzyme, EcoRl, which 
recognizes the sequence 5 ' -GAATTC-3 ' . 

10 

While not wishing to be bound by theory, it is 
thought that in nature, restriction endonucleases play a 
protective role in the welfare of the bacterial cell. 
They enable bacteria to resist infection by foreign DNA 

15 molecules such as viruses and plasmids that would 

otherwise destroy or parasitize them. They inpart 
resistance by binding to infecting DNA molecules and 
cleaving them in each place that the recognition 
sequence occurs. The disintegration that results 

20 inactivates many of the infecting genes and renders the 

DNA susceptible to further degradation by exonucleases . 

More than 3000 restriction endonucleases have been 
isolated from various bacterial strains. Of these, more 

25 than 240 recognize unique sequences, while the rest 

share common recognition specificities. Restriction 
endonucleases which recognize the same nucleotide 
sequence are termed "isoschizomers. " Although the 
recognition sequences of isoschizomers are the same, 

30 they may vary with respect to site of cleavage (e.g., 

Xmal V. Smal, Endow, et al . , J. Mol . Biol. 112:521 

(1977) ; Waalwijk, et al . , Nucleic Acids Res. 5:3231 

(1978) ) and in cleavage rate at various sites (Xhol v. 
PaeR7I, Gingeras, et al . , Proc. Natl. Acad. Sci. U.S.A. 

35 80:402 (1983) ) . 
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which may also be involved in DNA recognition, a DNA 
modification domain most similar to the gamma-class N6mA 
methyltransf erases, and a carboxy- terminal domain 
presumed to be involved in dimer formation and possibly 
DNA recognition. The enzyme requires SAM for both 
cleavage and modification activity. The single Mmel 
polypeptide is sufficient to modify the plasmid vector 
carrying the gene in vivo to provide protection against 
Mmel cleavage in vitro, yet it is also able to cleave 
unmodified DNAs in vitro when using the endonuclease 
buffer containing Mg++ and SAM. 

There is a continuing need for novel type II 
restriction endonuc leases . Although type II restriction 
endonucleases which recognize a number of specific 
nucleotide sequences are currently available, new 
restriction endonucleases which recognize novel 
sequences provide greater opportunities and ability for 
genetic manipulation. Each new unique endonuclease 
enables scientists to precisely cleave DNA at new 
positions within the DNA molecule, with all the 
opportunities this offers. 

SUMMARY OF THE INVENTION 

25 

In accordance with the present invention, there is 
provided a novel DNA fragment encoding a novel 
restriction endonuclease, obtainable from Methylophilus 
methylotrophus (NEB#1190) . The endonuclease is 
30 hereinafter referred to as "Mmel", which endonuclease: 

(1) recognizes the degenerate nucleotide sequence 
5 ' -TCC (Pu) AC-3 * in a doioble- stranded DNA 
molecule as shown below: 
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5 '-TCC(Pu)AC-3 • 
3'-AGG(Py)TG-5' 

(wherein G represents guanine, C represents 
5 cytosine, A represents adenine, T represents 

thymine, (Pu) represents a purine, either A or 
G, and (Py) represents a pyrimidine, either C 
or T) ; 

(2) cleaves DNA in the phosphodiester bond 
following the 20th nucleotide 3' to the 
recognition sequence 5 ' -TCC (Pu) AC-3 and 
preceding the 18th nucleotide 5 ' to the 
complement strand of the recognition sequence 
5 ■-GT(Py)GGA-3 ■ to produce a 2 base 3^ 
extension: 

5'-TCC(Pu)AC(N20)/-3' 
3 '-AGG(Py)GT(N18) /-5' ; and 
20 

(3) methylates the recognition sequence specified 
in (1) in vivo to protect the host DNA from 
cleavage by the Mmel endonuclease activity; 

25 The invention further relates to additional DNA 

fragments, each of which is identified to encode 
polypeptides which share significant sequence similarity 
to the Mmel restriction-modification polypeptide. The 
DNA fragment encoding the Mmel polypeptide enables the 

30 identification of these additional potential 

endonucleases by using similarity searching of the Mmel 
sequence against sequences available in databases, such 
as GENBANK, using a program such as BLAST (Altschul, et 
al. Nucleic Acids Res. 25:3389-3402 (1997)). These DNA 

35 fragments, as well as any other fragments with such 
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similarity to Mmel that may be deposited in the 
databases in the future, are candidates which may encode 
polypeptides that are similar to Mmel, in that the 
polypeptides encoded act as both restriction 
5 endonuclease and methyltransf erase. These polypeptides 

may, like Mmel, cleave DNA at a similarly far distance 
from the recognition sequence, in the range of 18 to 20 
nucleotides or more, which character is unique and 
useful in certain molecular biology technologies. 

10 Specifically these polypeptides contain amino acid 

motifs common to N6mA DNA methyltransf erases in the 
middle of the polypeptide, have a motif common to 
restriction endonucleases and located in the amino- 
terminal section of the polypeptides, consisting of the 

15 amino acids D/E{X8-X12)D/EXK, and have a region of 

several hundred amino acids following the conserved 
methyltransf erase motifs which are significantly similar 
to this region of Mmel and are believed to serve as a 
dimerization and possibly a DNA sequence recognition 

20 domain. An example of such a polypeptide, CstMI, is 

presented. CstMI has been shown to recognize the 6 base 
pair asymmetric sequence 5'-AAGGAG-3' and to cleave the 
DNA in the same manner as Mmel; 5 ' -AAGGAGN20/N18-3 ' . The 
endonuclease encoded by these DNA fragments may be 

25 produced by the process used for Mmel, as described 

below. 

The present invention further relates to a process 
for the production of the restriction endonuclease Mmel. 

30 This process comprises culturing a transformed host, 

such as E. coli, containing the DNA fragment encoding 
the Mmel restriction system polypeptide, collecting the 
cultured cells, obtaining a cell-free extract therefrom 
and separating and collecting the restriction 

35 endonuclease Mmel from the cell-free extract. The 
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present invention further relates to a process for the 
production of the restriction endonucleases encoded by 
the DNA sequences identified as homologous to Mmel. This 
process comprises culturing a transformed host, such as 
E, coli, containing the gene for these restriction 
systems, collecting the cultured cells, obtaining a 
cell-free extract therefrom and separating and 
collecting the restriction endonuclease from the cell- 
free extract. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 - Agarose gel showing Mmel cleavage of 
lambda, T7, phiXl74, pBR322 and pUC19 DNAs. 

Figure 2 - DNA sequence of the Mmel gene locus (SEQ 
ID N0:1) . 



Figure 3 - Amino acid sequence of the Mmel gene 
20 locus (SEQ ID N0:2) . 

Figure 4 - Agarose gel showing Mmel cleavage of 
pTBMmel.l DNA and unmodified DNA substrates. 

25 Figure 5 - Agarose gel showing Mmel cleavage of 

unmethylated, hemi -methylated and fully methylated DNA 
substrates . 



Figure 6 - Incorporation of labeled methyl groups 
30 into unmethylated, hemi -methylated and fully methylated 

DNA substrates. 



35 



Figure 7 - Multiple sequence alignment of Mmel 
amino acid sequence (SEQ ID NO: 3 through SEQ ID NO: 14) 
and homologous polypeptides from public databases. 
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DETAILED DESCRIPTION OF THE INVENTION 

The recognition sequence and cleavage site of the 
5 endonuclease of the present invention were previously 

described (Boyd, Nucleic Acids Res. 14: 5255-5274 
(1986)). However the Mmel enzyme proved difficult to 
produce from the native host, Methylophilus 
methylotrophus , due to very low yield of the enzyme and 

10 the relative difficulty of growing the M, methylotrophus 

host in large quantity. To overcome these limitations to 
producing Mmel, the present application describes the 
identification of the DNA sequence encoding the Mmel 
gene and the expression of this Mmel gene in a suitable 

15 host, in the present instance E. coli. This manipulation 

of the Mmel encoding DNA fragment results in both a 
significant increase in the amount of enzyme produced 
per gram of cells and a significant increase in ease of 
growth of large amounts of cells containing Mmel enzyme. 

20 

Several standard approaches typically employed by 
persons skilled in the art of cloning were applied to 
the task of cloning of Mmel without success. 
Specifically, the methylase selection approach (Wilson, 
25 et al., U.S. Patent No. 5,200,333) was attempted 

unsuccessfully. Several random libraries of M. 
methylotrophus DNA were constructed in E. coli and 
challenged by digesting with Mmel, but no Itoel methylase 
containing clones were obtained. 

30 

A second approach was also attempted but failed. In 
this approach, antibodies specific for N6mA were used to 
screen a library of random clones constructed in a 
lambda phage replacement vector. The approach was 
35 successful in obtaining methylase positive clones, but 
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all examined were found to express the methyl transferase 
of the second restriction system in M, methylotrophus, 
the Mmell methylase (recognition sequence 5'-GATC-3') 
rather than the desired Mmel methylase activity. 

5 

The successful approach to obtain the desired DNA 
fragment encoding the Mmel restriction system involved 
several steps. First a novel purification procedure was 
developed to purify the Mmel endonuclease peptide to 

10 homogeneity from M, methylotrophus . Once this ultra pure 

Mmel endonuclease polypeptide was successfully obtained 
in a significant amount, amino acid sequence from the 
amino terminus and from internal cyanogen-bromide 
degradation peptides was determined. Using the amino 

15 acid sequence obtained, degenerate DNA primers 

complementary to the DNA coding for the amino acid 
sequences were synthesized and used to PGR amplify a 
portion of the Mmel gene. The DNA sequence of this 
portion of the Mmel gene was determined. The entire Mmel 

20 endonuclease gene and surrounding DNA sequences were 

then obtained by applying the inverse PGR technique. A 
number of primers matching the DNA sequence obtained 
were designed, synthesized and used in combination with 
numerous different templates. The inverse PGR templates 

25 were produced by digesting M, methylotrophus genomic DNA 

with various restriction endonucleases and then ligating 
the cut M, methylotrophus DNA at low concentration to 
obtain circular molecules. The various primers were 
tried in combinations with the various templates to find 

30 primer- template combinations that produced a specific 

PGR amplification product. The products thus obtained 
were sequenced. Once the DNA sequence encoding the 
entire Mmel endonuclease gene was obtained, primers were 
designed to specifically amplify the gene from M. 

35 methylotrophus genomic DNA. The amplified gene was 
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accordance with the present invention. Therefore correct 
internal amino acid sequences determination, which 
enabled the cloning of the Mmel gene, depended on the 
novel purification method described in this application 
5 for the production of sufficiently pure Mmel in large 

enough quantity to determine cyanogen bromide internal 
fragment amino acid sequences, as performed in this 
Application. 

10 In Example II we obtained Mmel by culturing a 

transformed host carrying the Mmel gene, such as E. coli 
ER2683 carrying pTBMmel.l and recovering the 
endonuclease from the cells. A sample of E. coli ER2683 
carrying pTBMmel.l (NEB#1457) has been deposited under 

15 the terms and conditions of the Budapest Treaty with the 

American Type Culture Collection (ATCC) on July 3, 2002 
and bears the Patent Accession No. PTA-4521. 

For recovering the enzyme of the present invention 
20 E. coli carrying pTBMmel.l (NEB#1457) may be grown using 

any suitable technique. For example, E. coli carrying 
pTBMmel . 1 may be grown in Luria broth media containing 
lOO^g/ml ampicillin and incixbated aerobically at 37°C 
with aeration. Cells in the late logarithmic stage of 
25 growth are induced by adding 0.3inM IPTG, grown for an 

additional 4 hours, collected by centrifugation and 
either disrupted immediately or stored frozen at -70°C. 

The Mrtiel enzyme can be isolated from E. coli 
30 carrying pTBMmel.l cells by conventional protein 

purification techniques. For example, cell paste is 
suspended in a buffer solution and treated by 
sonication, high pressure dispersion or enzymatic 
digestion to allow extraction of the endonuclease by the 
35 buffer solution. Intact cells and cellular debris are 
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then removed by centrifugation to produce a cell- free 
extract containing Mmel . The Mmel endonuclease, along 
with its corresponding intrinsic methylase activity, is 
then purified from the cell-free extract by ion-exchange 
5 chromatography, affinity chromatography, molecular sieve 

chromatography, or a combination of these methods to 
produce the endonuclease of the present invention. 

The present invention also relates to methods for 

10 identifying additional DNA fragments, each of which 

encodes a polypeptide having significant amino acid 
sequence similarity to the Mmel polypeptide. The 
polypeptides encoded by these DNA fragments are 
predicted to perform similar functions to Mmel. 

15 Specifically, they are predicted to possess the dual 

enzymatic functions of cleaving DNA in a specific manner 
at a relatively far distance from the specific 
recognition sequence and also modifying their 
recognition sequences to protect the host DNA from 

20 cleavage by their endonuclease activity. Once the amino 

acid sequence of the Mmel endonuclease was determined as 
described in this application, sequences deposited in 
databases can be compared to this Mmel sequence to find 
those few sequences that are highly significantly 

25 similar to Mmel. This method is similar to that of U-S. 

Patent No. 6, 383, 770 (Roberts, et al.), except that here 
we are searching for similarity to the Mmel endonuclease 
sequence, rather than searching for sequences that match 
a database of methyltransf erase or endonuclease proteins 

30 and then examining any unidentified open reading frames 

next to potential methyltransf erase open reading frames. 
Prior to identifying the Mmel amino acid sequence, the 
DNA sequences coding for proteins related to Mmel had 
not been included in the database of restriction and 

35 methyltransf erase gene sequences utilized by Roberts, et 

al-, supra since these sequences had not been linked to 
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any known endonuclease function. The method disclosed 
herein of identifying potential Mmel-like endonucleases 
is thus more specific than the method of U.S. Patent No. 
6,383,770 (Roberts, et al . ) . 

5 

Similarity searching of the Mmel sequence against 
sequences available in databases, such as GENBANK, is 
accomplished using a program such as BLAST (Altschul, et 
al. Nucleic Acids Res. 25:3389-3402 (1997)). A sequence 

10 with an expectation value (E) score of less than E= e"^° 

is considered a potential candidate endonuclease. 
Sequences that give expectation values that are much 
lower, such as less than E=e"^° is to be considered as 
highly likely to be endonucleases like Mnnel. Such 

15 candidate Mmel-like peptides are further examined to see 

if they conform to the domain architecture that Mcnel 
exhibits. A true candidate will contain an endonuclease 
fold motif, usually of the form (D/E) X8-X12 (D/E) XK in 
the amino- terminal portion of the peptide, (Aravind et 

20 al. Nucleic Acid Res. 28:3417-3432 (2000)). A true 

candidate will contain methyl transferase motifs in the 
middle portion of the peptide similar to gamma class N6- 
methyl adenine methyl transferases, and sequences similar 
to the carboxyl portion of Mmel in the carboxyl portion 

25 of the candidate peptide. Such a BLAST search performed 

on June 12, 2003 returned the following sequences as 
highly significantly similar to Mmel: 
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Most of these proteins are labeled as hypothetical 
or putative in their database entries. A niimber of these 
appear to be full-length polypeptides, such as sequence 
#2 above: GcrY. Such candidates can be expressed as 
5 described in Roberts to identify the expected 

endonuclease activity. Some endonuclease genes may be 
inactive in the particular strain used for sequencing 
(Lin, et al • Proc . Natl. Acad. Sci. USA 98:2740-2745 
(2001) ) . In such a circumstance it may prove possible to 

10 express functional endonucleases by repairing the 

mutations that have inactivated these genes. Several of 
the Mmel homologs, such as #7 (SEQ ID NO: 14) (Deinococcus 
radiodurans DR2267) and #8 (SEQ ID NO: 13) (Deinococcus 
radiodurans DR0119.1) have disruptions in the open 

15 reading frames. DR2267 has a stop codon, TAG, which 

prematurely terminates the open reading frame, in a 
position where Mmel has a glutamate amino acid coded for 
by the codon GAG. By changing this TAG stop codon to GAG 
it may be possible to reactivate this potential 

20 endonuclease gene. DR0119.1 is also disrupted, in that 

it has a frameshift that disrupts open reading frame. 
The Mmel sequence may be used as a guide to direct where 
to repair this frameshift by maximizing the similarity 
of the DR0119.1 seciuence to the Mmel sequence. This may 

25 well restore DR0119.1 endonuclease activity. 

An alternative way to generate potential new 
endonucleases is to take advantage of their similar 
domain structure by performing domain swapping. One may 

30 be able to swap the amino terminal domain of an Mmel- 

like peptide, for the amino terminal domain in the Mmel 
protein, for example by swapping the sequence of the 
potential new gene up to the first methyl transferase 
motif (motif X, "Gly Ala His Tyr Thr Ser" into Mmel to 

35 replace this portion of Mmel up to the same sequence. 
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This approach may be particularly useful when only a 
partial sequence is available or a potential gene has 
lost function due to multiple mutations. This approach 
will create a chimeric protein that potentially has 
5 endonuclease activity and cleaves at a distance away 

from the recognition sequence, like Mmel, but that 
recognizes a novel DNA sequence. One may also find 
sequences in the databases that are highly similar to 
Mmel but that are partial. For example, sequence #11 

10 (SEQ ID NO: 9) above (Pseudomonas fluorescens) is from a 

small fragment of DNA sequence in the database- To 
obtain a functional endonuclease like Mmel from this 
sequence one can use inverse PGR or other techniques to 
obtain DNA sequence adjacent to the fragment reported, 

15 then use that sequence to obtain an intact endonuclease 

gene. 

Once a sequence is identified the potential 
endonuclease can be expressed and characterized as 

20 ' described in Roberts, et al. supra. Here, however, there 

is no separate methyltransf erase gene to express along 
with the endonuclease. Once such a potential 
endonuclease is cloned and expressed in a suitable host, 
such as in E. coli, a cell free extract is prepared and 

25 analyzed to detect any endonuclease activity. Such an 

endonuclease assay must include the SAM cofactor 
required by these endonucleases . Once specific DNA 
cleavage activity is found the recognition sequence and 
cleavage site may be determined by standard methods. 

30 (Schildkraut, (1984) In Genet. Eng. (N Y) Vol 6. (Setlow 

J.K., Hollaender, A. Ed.), pp 117-140. Plenum Press, New 
York. "Screening for and characterizing restriction 
endonucleases . " ) 
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The enzymes so identified can be isolated from 
coli cells carrying the DNA fragment in a suitable 
vector by conventional protein purification techniques. 
For example, cell paste is suspended in a buffer 
5 solution and treated by sonication, high pressure 

dispersion or enzymatic digestion to allow extraction of 
the endonuclease by the buffer solution. Intact cells 
and cellular debris are then removed by centrifugation 
to produce a cell-free extract containing the enzyme. 

10 The endonuclease, along with its corresponding intrinsic 

methylase activity, is then purified from the cell-free 
extract by ion-exchange chromatography, affinity 
chromatography, molecular sieve chromatography, or a 
combination of these methods to produce the endonuclease 

15 of the present invention. 

These DNA fragments, as well as any other fragments 
with such similarity to Mmel that may be deposited in 
the databases in the future, are predicted to encode 

20 polypeptides that are similar to Mmel, in that the 

polypeptides encoded act as both restriction 
endonuclease and methyltransf erase . These polypeptides 
may, like Mmel, cleave DNA at a similarly far distance 
from the recognition sequence, in the range of about 18 

25 to 20 nucleotides or more, which character is unicjue and 

useful in certain molecular biology technologies. 

An example of such an enzyme identified by this 
process is CstMI (see U.S. Application Serial 

30 No. , filed concurrently herewith) . CstMI was 

identified as a potential endonuclease because of its 
highly significant amino acid sequence similarity to 
Mmel. CstMI is encoded by sequence #2 above (SEQ ID 
N0:8), which gave highly significant Expectation value 

35 of e'^^^ when compared to Mmel by BLAST. CstMI recognizes 
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the 6 base pair asymmetric sequence 5 ' -AAGGAG-3 ' and 
cleaves the DNA in the same manner as Mmel : it cleaves 
the phosphodiester bond between the 20th and 21st 
residues 3 ' to this recognition sequence on this DNA 
strand, and between the 18th and 19th residues 5' to the 
recognition sequence on the complement strand 5 ' -CTCCTT- 
3' to produce a 2 base 3' extension. 

The present invention is further illustrated by the 
following Examples. These Examples are provided to aid 
in the understanding of the invention and are not 
construed as a limitation thereof . 

The references cited above and below are herein 
incorporated by reference. 

EXAMPLE I 

PURIFICATION OF Iffinel ENXX^NUCLEASE 

A single colony of Methylophilus methylotrophus 
(NEB#1190) was grown for 24 hrs in 1 liter of medium M 
(0.08 MM CUSO4, 0.448 ^M MnS04, 0.348 ZnS04, 6.0 MM 
FeCl3, 18 MM CaC03 , 1 . 6 mM MgSO^, 9 . 0 mM NaH2P04, 10.9 mM 
K2HPO4, 13.6 mM (NHJsSOj for 24 hours. This culture was 
used to inoculate 100 liters of medixom M. The cells 
were grown aerobically at 31 ^C, overnight, until 
stationary. Five 100-liter fermentations were required 
to harvest 752 grams of wet cell pellet. 

750 gram of M. methylotrophus cell pellet was 
suspended in 2.25 liters of Buffer A (20 mM Tris-HCl (pH 
8.0), 50 mM NaCl, 1 . 0 mM DTT, 0 . 1 mM EDTA, 5% Gycerol) 
and passed through a Gaulin homogenizer at -12,000 psig. 
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The lysate was centrifuged at -13,000 x G for 40 minutes 
and the supernatant collected. 

The supernatant solution was applied to a 500 ml 
5 Heparin Hyper-D colintin (BioSepra SA) which had been 

equilibrated in buffer A. A 1.0 L wash of buffer A was 
applied, then a 2 L gradient of NaCl from 0 . 05 M to 1 M 
in buffer A was applied and fractions were collected. 
Fractions were assayed for Ltoie I endonuclease activity 
10 by incixbating with 1 /xg Lambda DNA (NEB) in 50 /xl 

NEBuffer 1, supplemented with 32 mM S-adenosyl-L- 
methionine (SAM) for 15 minutes at 31^ C. Mmel activity 
eluted at 0.3 M to 0.4 M NaCl. 

15 The Heparin Hyper-D column fractions containing the 

Mme I activity were pooled, diluted to 5 0 mM NaCl with 
buffer A (without NaCl) and applied to a 105 ml Sourcel5 
Q column (Amersham Biotech) which had been equilibrated 
with buffer A. A 210 ml wash with buffer A was applied 

20 followed by a 1.0 L gradient of NaCl from 0.05 M to 0.7 

M in buffer A. Fractions were collected and assayed 
from Mme I endonuclease activity. The ly&ne I activity was 
found in the unbound fraction. 

25 The Sourcel5 Q pool was loaded onto a 22 ml AF- 

Heparin-TSK column (TosoHaas) which had been 
equilibrated with buffer A. A wash of 44 ml buffer A 
was applied, followed by a linear gradient of NaCl from 
0.05 M to 1.0 M in buffer A. Fractions were collected 

30 and assayed from Mme I endonuclease activity. The Mme I 

activity eluted between 0.26 M and 0.29 M NaCl. The 
fractions containing activity were pooled and dialyzed 
against buffer B (20 mM NaPO^ (pH 7.0), 50 mM NaCl, 1.0 
mM DTT, 0.1 mM EDTA, 5% Glycerol) . 



35 
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The dialyzed AF-Heparin-TSK pool was loaded onto a 
6 ml Resourcel5 S column {Aitiersham Biotech) which had 
been equilibrated with buffer B. A wash of 12 ml buffer 
B was applied, followed by a linear gradient of NaCl 
5 from 0.05 M to 1.0 M in buffer B. Fractions were 

collected and assayed for Mme I endonuclease activity. 
Mme I activity eluted between 0.14 M and 0.17 M NaCl. 

This pool was applied to a 2 liter Superdex 75 
10 sizing column (Amersham Biotech) which had been 

equilibrated with buffer C (20 mM Tris-HCl, pH 8.0, 500 
inM NaCl, 1 . 0 mM DTT, 0 . 1 mM EDTA, 5% Glycerol) . 
Fractions were collected between 500 and 1500 ml 
elution with buffer C, then assayed by Mme endonuclease 
15 assay and polyacrylamide gel electrophoresis on 4-20% 

gradient gel, followed by protein staining with 
Coomassie Brilliant Blue dye. Fractions eluting between 
77 5 and 825 ml corresponded to Mme I activity and a 
protein band of 105 kDa. These fractions were pooled and 
20 dialyzed against buffer D (20 mM NaP04 (pH 7.0), 50 mM 

NaCl, 1 mM DTT, 5% Glycerol) . 

The dialyzed sizing pool was applied to a 16 ml 
Ceramic HTP col\imn (BioRad) which had been equilibrated 

25 with buffer D. A 32 ml wash with buffer D was followed 

by a linear gradient from 0.02 M to 1.0 M NaP04 in 
buffer D. Fractions were collected and assayed by ly&ne 
endonuclease assay and polyacrylamide gel 
electrophoresis on a 4-20% gradient gel, followed by 

30 protein staining with Coomassie Brilliant Blue dye. Mme 

I eluted between 0.26 M and 0.3 M NaPO^. A portion of 
several fractions containing a single homogeneous 
protein band of 105 kDa were used for protein 
sequencing. The rest of the purified Mmel fractions were 

35 pooled (6 ml @ .36 mg/ml) and dialyzed against storage 
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buffer (10 mM Tris (pH 7.9), 50 itiM KCl, litiM DTT, .1 iriM 
EDTA, 50% glycerol) . The purified Mmel enzyme was stored 
at -20OC. 

Activity determination: 

Samples from 1-4 ^il were added to 50 ^il substrate 
solution consisting of IX NEBuffer 1, 32 |iM S-adenosyl- 
L-methionine, and 1 ^g DNA (lambda, PhiX174 or pUC19 
DNAs) . Reactions were incubated for 15 minutes at 37°, 
received 20 ^ll stop solution and were analyzed by 
electrophoresis on a 1% agarose gel. 

Qptimized endonuclease activity 

Following purification of Mmel from 
methylotrophus, experiments were performed to deteirmine 
the optimal reaction conditions for DNA cleavage. 
Endonuclease activity was found to be significantly 
enhanced by the presence of potassium in the reaction 
buffer. Reactions were performed at 4^0 to 37^0 and from 
5 to 60 minutes with no appreciable change in the amount 
of DNA cleavage. Enzyme concentrations at or near 
stoichiometric equivalence to DNA sites were required 
for maximal cleavage. Large excess of enzyme blocked 
cleavage- These findings were used to reassess the 
activity of Mmel and to define a workable endonuclease 
unit . 

Unit definition 

One unit of Mmel is defined as the amount of Mmel 
required to completely cleave 1 Mg of PhiX17 4 DNA in 15 
minutes at 37°C in NEBuffer 4 (20 mM Tris-acetate, 10 mM 
magnesium acetate, 50 mM potassium acetate, 1 mM 
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dithiothreitol (pH7.9 at 25^0) supplemented with SO/xM 
S-adenosyl-L-methionine (SAM) . 



EXAMPLE II 



CLONING THE Mnel ENDONUCLEASE 



1. DNA purification: Total genomic DNA of 
Methylophilus methylotrophus was prepared. 5 grams of 

10 cell paste was suspended in 20 ml of 25% sucrose, 0.05 M 

Tris-HCl pH 8.0, to which was added 10 ml of 0.25 M 
EDTA, pH 8.0. Then 6 ml of lysozyme solution (10 mg/ml 
lysozyme in 0.25 M Tris-HCl, pH 8.0) was added and the 
cell suspension was incubated at 4^0 for 16 hours. 25 

15 ml of Lytic mix (1% Triton-XlOO, 0.05 M Tris, 62 mM 

EDTA, pH 8.0) and 5 ml of 10% SDS was then added and the 
solution incubated at 37^0 for 5 minutes. The solution 
was extracted with one volume of equilibrated 
phenol: chloroform: isoamyl alcohol (50:48:2, v/v/v) and 

20 the aqueous phase was recovered and extracted with one 

volume of chloroform: isoamyl alcohol (24:1, v/v) two 
times . The aqueous solution was then dialysed against 
four changes of 2 L of 10 mM Tris, 1 mM EDTA, pH 8.0. 
The dialysed DNA solution was digested with RNase (100 

25 Kig/ml) at 37^0 for 1 hour. The DNA was precipitated by 

the addition of 1/lOth volume 5 M NaCl and 0.55 volumes 
of 2-propanol and spooled on a glass rod. The DNA was 
briefly rinsed in 70% ethanol, briefly air dried and 
dissolved in 20 ml TE (10 mM Tris, 1 mM EDTA, pH 8.0) to 

30 a concentration of approximately 500 p,g/ml and stored at 

4°C. 



35 



2. The Mmel endonuclease was purified to 
homogeneity as described in Example I above. 
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3 . Amino acid sequences of the Mnel endonuc lease 
were obtained for the amino terminus and for several 
internal cyanogen bromide digestion products of the Mmel 
polypeptide. The Mmel restriction endonuclease, prepared 
5 as described in Example I above, was subjected to 

electrophoresis and electroblotted according to the 
procedure of Matsudaira (Matsudaira, J. Biol. Chem. 
262:10035-10038, 1987)), with modifications as 
previously described (Looney, et al. Gene 80:193-208 

10 (1989)) . The membrane was stained with Coomassie blue R- 

250 and the protein band of approximately 105 kD was 
excised and subjected to sequential degradation on an 
ABI Precise 494 Protein/ Peptide Sequencer with gas-phase 
delivery (Waite-Rees, et al . J. Bacteriol . 173:5207-5219 

15 (1991)) - The amino acid sequence of the first 14 amino 

terminal residues obtained was the following: 
ALSWNEIRRKAIEF (SEQ ID NO: 15). 

An additional sample of the Mmel endonuclease, 20 

20 /ig in 20 /il/ was treated with 2 of cyanogen bromide 

(Sigma) dissolved in 200 m1 of 88% distilled formic acid 
for 24 hours in the dark at room temperature. This 
reaction mixture was evaporated to dryness and 
resuspended in 20 m1 of loading buffer (1.5M Tris-HCl, 

25 pH 8.5, 12% glycerol, 4% SDS, 0.05% Serva Blue G, 0.05% 

Phenol Red) at lOO^C for 5 minutes. This sample was 
sxibjected to electrophoresis on a Tris-Tricine 10 to 20% 
polyacrylamide gradient gel (Invitrogen) for three hours 
and then transferred to a polyvinylidene difluoride 

30 (PVDF) membrane (Problott, Applied Biosystems Inc.) 

using 10 mM CAPS buffer (lOrriM 3- [cyclohexylamino] -1- 
propanesulfonic acid, 10% methanol, 0.05% SDS, 0.005% 
dithiotheritiol, adjusted to pH 11.0 with NaOH) for 18 
hours at 200 volts in a tank electroblotter (TE52, 

35 Hoeffer) . The membrane was stained with Coomassie blue 
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R-250 and major bands of 25 kilodaltons (kD) , 14 kD, 7.5 
kD and 6 kD were observed, as well as smaller bands. 
These stained protein bands were excised from the 
membrane and each subjected to sequential degradation. 
5 The fragments other than the amino terminal fragment are 

derived from internal cleavage by cyanogen bromide at 
methionine residues from within the protein and thus 
should be preceded by a methionine. The first 29 
residues of the 25 kD peptide corresponded to 

10 (M) KISDEFGNYFARIPLKSTXXIXEXNALQ (SEQ ID NO: 16). Residues 

20, 21, 23 and 25, labeled X, were not identified- The 
first 40 amino acid residues obtained from the 14kD 
fragment were: (M) DAKKRRNLGAHYTSEANILKLI 
KPLLLDELWWFXKVKN (SEQ ID NO: 17). Residue 36 was not 

15 determined. The first 25 residues of the 7 . 5 kD peptide 

corresponded to (M) KSRGKDLDKAYDQALDYFSGIAER (SEQ ID 
NO: 18) . The 6kD fragment was found to contain a mixture 
of three sequences. 

20 4. Amplification of a portion of the Mmel 

endonuclease : The peptide sequence data from the amino 
terminus, 25 kD, 14kD and 7.5kD peptides was used to 
construct a series of degenerate PGR primers 
corresponding to the codons for the amino acid residues. 

25 The order of the internal peptide fragments was unknown, 

so both forward (sense strand) and reverse (antisense 
strand) primers were made for these fragments. The 
primers were: 

30 25 kD fragment: residues DEFGNYFA (SEQ ID NO: 19) 

Foirward : 

1) 5 • -GARTTYGGNAAYTAYTTYGC-3 • (SEQ ID NO: 20) 



35 



Reverse : 

2 ) 5 • -AARTARTTNCCRAAYTCRTC- 3 ' ( SEQ ID NO : 2 1 ) 
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14 kD fragment: residues MDAKKR (SEQ ID NO: 22) 
Forward A: 

3) 5 • -ATGGAYGCNAARAARCG-S • (SEQ ID NO: 23) 
Forward B: 

4 ) 5 ' - ATGGAYGCNAARAARAG- 3 ' (SEQ ID NO : 2 4 ) 
Reverse : 

5 ) 5 » -CGNCGYTTYTTNGCRTCCAT- 3 ' ( SEQ ID NO : 2 5 ) 

7.5 kD fragment: residues DKAYDQA (SEQ ID NO: 26) 
Forward : 

6 ) 5 ' -GAYAARGCNTAYGAYCARGC- 3 ' ( SEQ ID NO : 2 7 ) 
Reverse : 

7 ) 5 ' -GCYTGRTCRTANGCYTTRTC - 3 * ( SEQ ID NO : 2 8 ) 

where 
Y = T,C 
R = A,G 
H = A,T,C 
S = G,C 
N = A,C,G,T 

Primers 1 and 2 are derived from the Mmel 25 IcD 
CNBr peptide and were prepared to prime on the sense 
35 strand (1) or the antisense strand (2) of the gene. 

Primers 3 through 5 are derived from the 14 kD CNBr 
peptide and were prepared to prime on the sense strand 
(3 and 4) or the antisense strand (5) of the gene, with 
3 and 4 differing in the codon usage for the arginine 
40 residue. Primers 6 and 7 are derived from the 7.5 kD 

CNBr peptide and were prepared to prime on the sense 
strand (6) or the antisense strand (7) of the gene. 



10 



15 



20 



25 



45 



PCR amplification reactions were performed using 
the primer combinations of 1 with 5, 1 with 7, 3 with 2, 
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3 with 7, 4 with 2, 4 with 7, 6 with 2 and 6 with 7. A 
portion of the Mmel gene was amplified in a PGR reaction 
by coinbining: 

80 111 lOX Thermopol buffer (NEB) 
50 M-l 4mM dNTP solution (NEB) 

4 ^ll Mmel genomic DNA (SOOfig/ml stock) 
16 ^ll lOOmM MgSO^ 
586 ^1 dH20 

16 |ll (32 units) Vent® exo- DNA polymerase (NEB) . 

This master mix was divided into 8 aliquot s of 90 
111, to which was added 5 |Xl forward primer (10 mM stock) 
and 5 M,l reverse primer (10 m stock) . The cycling 
15 parameters were 95°C for 3 minutes for one cycle, then 

95^C for 30 seconds, 46°C for 30 seconds, 72^0 for 2 
minutes, for 25 cycles - 

The amplification reactions were electrophoresed on 
20 a 1% agarose gel and analyzed. Major DNA amplification 

products of 450 base pairs (bp) (primers 2 with 4), 650 
bp (primers 5 with 6) and 1100 bp (primers 2 with 6) 
were obtained. These fragment sizes are consistent with 
the 7 . 5 kD CnBr fragment being located nearest the amino 
25 end of the protein and approximately 650 bp away from 

the 14kD CnBr fragment, with the 14 kD fragment between 
the 7.5 kD and the 25 kD fragment and adjacent to the 25 
kD fragment. The amplified DNA fragments were gel 
purified and sequenced using the primers that were used 
30 for the amplification. A translation of the DNA sequence 

obtained matched the amino acid sequence derived from 
the purified Mmel endonuclease, indicating that a 
portion of the Mmel endonuclease gene DNA sequence had 
been successfully obtained. 



5 



10 



35 
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5. Determining the DNA sequence for the entire 
Mmel gene and adjacent DNA: The inverse PGR technique 
was used to extend the DNA sequence from both sides of 
the 1060 bp of the Mmel gene obtained above. To 
5 accomplish this a series of primers matching the Mmel 

gene DNA sequence and oriented for inverse PGR were 
designed and synthesized. Mmel genomic DNA was cut with 
a niimber of restriction endonucleases and ligated at low 
concentration to generate circular DNA templates. 

10 

A. Mmel genomic DNA was digested with ten different 
restriction endonucleases and then circularly ligated to 
obtain DNA templates to amplify using the inverse PGR 
technique. The restriction enzymes used were: 

15 

BspHI (T/CATGA) 

EcoRI (G/AATTG) 

Hindlll (A/AGGTT) 

HinPlI (G/GGC) 
20 Mspl (C/CGG) 

Nlalll (CATG/) 

PstI (GTGCA/G) 

Sad (GAGGT/G) 

SphI (GGATG/G) 
25 Xbal (T/GTAGA) 

Restriction enzyme digests were performed by 
combining : 

30 5 |Jil lOX NEBuffer recommended for the enzyme (varied 

with enzyme) 

2 ^1 M. methyloptrophus genomic DNA (1 |i,g) 
43 ^1 dHp 

1 |xl (10-20 units) restriction enzyme. 

35 



The reactions were inctibated for 1 hour at 37°G. 
The restriction endonuclease was inactivated by heating 
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the reaction to 65°C (80^C for PstI) for 20 minutes. The 
digested DNA was then ligated into circular fragments by 
adding 50 M'l lOX T4 DNA ligase buffer, 400 fil (SH^O and 3 
^il concentrated T4 DNA ligase (6000 units. New England 
5 Biolabs, Inc.) and inciibating at 16°C for 16 hours. The 

ligated DNA was then extracted with phenol and 
chloroform, precipitated with 2-propanol and resuspended 
in 100 ^ll TE buffer. 

B. Amplification of DNA adjacent to the 1060 bp 
fragment of the Mmel endonuclease gene: Two pairs of PGR 
primers were designed, one near each end of the 1060 bp 
sequence obtained from direct PGR with degenerate 
primers . The primer sequences were : 

primer IP 1 : 

5 ' -GTTGGATCCGGGAGAGATTGGTGAGG-3 ' ( SEQ ID NO : 2 9 ) 
primer IP 2 : 

5 • -GTTGGATGGTAGGTTAATCTGAATAAGATG- 3 ' ( SEQ ID NO : 3 0 ) 
primer IP 3 : 

5 ' -GTTGGATCCTGTTAATCTGAAACGCTGG- 3 ' { SEQ ID NO : 3 1 ) 
primer IP 4 : 

30 5'-GTTGGATGGTTATAGCAAAATGTGAGGTG-3' (SEQ ID NO: 32) 

Inverse PGR reactions were performed on the 10 
circularized templates produced above with the primer 
pairs of IP 1 with IP 2 , IP 3 with IP 4, and IP 1 with 
35 IP 3 . The amplification reactions were assembled by 

combining : 



10 



15 



20 



25 



40 



80 ^1 lOX Thermopol buffer (NEB) 
50 Jil 4mM dNTP solution (NEB) 
40 [il IP primer (forward) 



Docket No.: NEB-207-US 

-33- 

40 |il IP primer (reverse) 
16 |xl lOOmM MgSO^ 
534 fxl dHjO 

16 |Xl (32 units) Vent® exo- DNA polymerase (NEB) . 

The master mix was aliquoted into ten tubes of 7 6 
^l, to which was added 4 \il of the appropriate digested, 
circularly ligated template. The cycling parameters were 
95^C for 3 minutes for one cycle, then 95^C for 30 
seconds, 56°C for 30 seconds, 72°C for 3 minutes, for 25 
cycles. Amplification products were analyzed by agarose 
gel electrophoresis. 

For primers IP 1 and IP 2 with the SphI template 
15 and the Nlalll template a product of approximately 825 

bp was obtained. For primers IP 3 and IP 4 with the 
BspHI template a product of approximately 800 bp was 
obtained. For primers IP 1 and IP 3 with the EcoRI 
template a product of approximately 1500 bp was 
20 obtained. These amplified DNA fragments were gel 

purified, sequenced and assembled with that previously 
obtained. The assembled sequence did not contain the 
entire Mmel endonuclease open reading frame. The 
assembled sequence was used to direct synthesis of a 
25 second group of inverse PGR primer pairs. The sequences 

of these primers were: 

primer IP 5 : 

30 5 ' -TTCAGAAATACGAGCGATGC- 3 ' ( SEQ ID NO : 3 3 ) 

primer IP 6 : 

35 5 • -GTCAAGCCATAAACACCATC-3 ' ( SEQ ID NO : 3 4 ) 



5 



10 



primer IP 7 : 

40 5 • -GAGGGTCAGAAAGGAAGCTG-3 ' (SEQ ID NO : 35) 
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primer IP 8 : 

5 ' -GTCCAACTAACCCTTTATGG-3 ' (SEQ ID NO : 36 ) 

5 

Inverse PGR amplification reactions were performed 
as above. Using primers IP 5 and IP 6, products were 
obtained from the Nlalll template (approximately 450 bp) 
and the Mspl template (approximately 725 bp) , but not 

10 from the other circular ligation templates. Using 

primers IP 7 and IP 8, products were obtained from the 
EcoRI template (approximately 500 bp) , the SphI template 
(approximately 825 bp) and the BspHI template 
(approximately 7 50 bp) . These DNA fragments were 

15 sequenced and the sequence was assembled with that 

previously obtained. The assembled sequence did not yet 
contain the entire Mmel endonuclease open reading frame, 
so another round of primer synthesis and inverse PGR was 
performed. Additional DNA templates were generated as 

20 above, but using the restriction enzymes Apol (R/AATTY) , 

Asel (AT/TAAT) , BsaHI (GR/GGYG) , Mfel (G/AATTG) , Sspl 
(AAT/ATT) and EcoRV (GAT/ATG) to digest M. 
methylotrophus genomic DNA. The sequences of this third 
round of primers were: 

25 

primer IP 9 : 

5 ' -TTCCTAGTGGTGAACCTTTG-3 ' ( SEQ ID NO : 3 7 ) 
30 primer IP 10: 

5 ' -GTTGGGTTACTTGAAATGAG - 3 ' ( SEQ ID NO : 3 8 ) 
primer IP 11: 

35 

5 ' -CCAAAATGGAAGTTGTTTCG-3 • (SEQ ID NO: 39) 
primer IP 12 : 

40 5 • -GTGAGTGCGCCCTGAATTAG- 3 ' ( SEQ ID NO : 4 0 ) 
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Inverse PGR amplification reactions were performed 
as above. Using primers IP 9 and IP 10, products were 
obtained from the Nlalll template (approximately 425 
bp) , the Mfel template (approximately 750 bp) , the Apol 
template (approximately 800 bp) and the Mspl template 
(approximately 2100 bp) . Using primers IP 11 and IP 12, 
products were obtained from the SphI template 
(approximately 875 bp) , the BspHI template 
(approximately 925 bp) and the EcoRI template 
(approximately 950 bp) . These DNA fragments were 
sequenced and the sequence was assembled with the 
sequences previously obtained. Further sequencing was 
performed on the IP 9, IPIO Mspl 2100 bp product using 
three additional primers: 

primer SI : 

5 • -GCTTCATTTCATCCTCTGTGC-3 ' ( SEQ ID NO : 41 ) 
20 

primer S2 : 

5 • -TAACCGCCAAAATTAATCGTG-3 ' ( SEQ ID NO : 42 ) 
primer S3 : 

25 

5 ' -CCACTATTCATTACAACACC- 3 ' ( SEQ ID NO : 4 3 ) 

The final assembled sequence (Figure 2) contained 
the entire mcael restriction gene, as well as 1640 bp of 
30 sequence preceding the gene and 1610 bp of sequence 

following the gene. 



10 



35 



6- Cloning the mael endonuclease gene in E. 
colli The putative Mmel endonuclease open reading frame 
was identified from the DNA sequence assembly obtained 
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from sequencing the various inverse PGR amplified DNA 
fragments. The beginning of the open reading frame was 
identified on the basis of the match of the predicted 
amino acid sequence at the amino terminus of the open 
5 reading frame with the sequence determined from the Mmel 

endonuclease protein. The predicted end of the open 
reading frame would allow for the coding of an 
approximately 105 kD polypeptide, which matched the 
observed size of the native Mmel endonuclease. The amino 

10 acid sequence deduced from translation of this open 

reading frame contained conserved sequence motifs of 
N6mA DNA methyl transf erases . However, no open reading 
frame containing sequence motifs conserved among DNA 
methyl transf erases was observed adjacent to the Mmel 

15 endonuclease gene, as had been expected. It was decided 

to try to express the Mmel endonuclease in E, coli 
without having a second methyl transferase present to 
protect the E. coli host DNA from cleavage. 
Oligonucleotide primers were synthesized to specifically 

20 amplify the Mmel gene from M. methylotrophus genomic DNA 

for expression in the cloning vector pRRS (Skoglund, 
Gene 88:1-5 (1990)) . The forward primer contained a PstI 
site for cloning, a stop codon in frame with the lacZ 
gene of the vector, a consensus E. coli ribosome binding 

25 site, the ATG start codon for translation (changed from 

the GTG used by M. methylotrophus to facilitate greater 
expression in E. coli) and 20 nucleotides that matched 
the M. methylotrophus DNA sequence: 

30 5 ' -GTTCTGCAGTTAAGGATAACATATGGCTTTAAGCTGGAACGAG- 3 ' 

(SEQ ID NO: 44) 

The reverse primer contained a BamHI site for 
cloning and 22 nucleotides that matched the M. 
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methylotrophus DNA sequence 3 ' to the end of the Mmel 
open reading frame: 

5 ' -GTTGGATCCGTCGACATTAATTAATTTTTGCCCTTAG-3 ' 
5 (SEQ ID NO:45) 

The Mmel gene was amplified in a PGR reaction by 
combining: 

50 ^ll lOX Thermopol buffer (NEB) 
30 |il 4rniy[ dNTP solution 
12.5 \ll forward primer (lO^iM stock) 
12.5 |il reverse primer (10[iM stock) 
5 [il Mmel genomic DNA (500|Xg/ml stock) 
387 |Xl dH^O 

3 |J.l (6 units) Vent® DNA polymerase 

The reaction was mixed and aliquoted into 5 tubes 
of 80 |Lil each. MgSO^ was added (lOOmM stock) to bring 
the final concentration of Mg++ ions to 2mM, 3mM, 4mM, 
5mM and 6mM respectively. The cycling parameters were 
95^C for 30 seconds, 60^0 for 30 seconds, 72°C for 3 
minutes, for 24 cycles. The reactions were analyzed by 
gel electrophoresis and the 3mM through 6mM Mg++ 
reactions were found to contain a DNA band of the 
desired size of 2-8kb. These reactions were pooled and 
the 2.8kb band was gel purified. The 2.8kb amplified 
Mmel gene fragment was digested with BamHI and PstI 
endonucleases (NEB) in the following reaction 
conditions : 

15 ^il lOX BamHI reaction buffer (NEB) 
1.5 |Lll BSA (NEB) 

50 jiil Mmel gene 2 . 8 kb amplified DNA fragment 
35 80 Jil dH20 
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5 |xl BamHI endonuclease (100 units) 
5 jil PstI endonuclease (100 units) 

The reaction was mixed and incubated for 1 hour at 
5 37^0- The small fragments cleaved off the ends of the 

2.8kb DNA fragment were removed, along with the 
endonucleases, by purification on a Qiagen QiaPrep spin 
column according to the manufacturer's instructions. 

10 The cleaved Mmel gene DNA fragment was ligated to 

the pRRS vector as follows: 10 ^1 of the digested, 
purified 2.8kb ly&nel fragment was combined with 5 \ll pRRS 
vector previously cleaved with BamHI and PstI and 
purified, 5 |ll dHjO, 20 \il 2X QuickLigase Buffer (NEB), 

15 the reaction was mixed, and 2 |Xl of QuickLigase was 

added. The reaction was incubated at room temperature 
for 5 minutes. 5 [il of the ligation reaction was 
transformed into 50 |li1 chemical competent E, coll ER2 683 
cells and the cells were plated on L-broth plates 

20 containing 100 ^ig/ml ampicillin and incubated at 37°C 

overnight. Approximately 200 transf ormants were obtained 
and 18 representatives were analyzed as follows: plasmid 
from each colony was isolated by miniprep procedures and 
digested with AlwNI and Ndel endonucleases to determine 

25 if they contained the correct size insert. 2 of the 18 

transformants had the correct size insert of 
approximately 2800 bp. Both clones were tested to see if 
they produced Mmel endonuclease activity. The clones 
were grown overnight at 37^0 in 500 mL L-broth 

30 containing 100 \ig/ml ampicillin. The cells were 

harvested by centrifugation, suspended in 10 mL 
sonication buffer (20mM Tris-HCl, ImM DTT, 0 . ImM EDTA, 
pH7.5) and broken by sonication. The crude lysate was 
cleared by centrifugation and the supernatant was 

35 recovered. The lysate was assayed for endonuclease 
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activity by serial dilution of the lysate in IX reaction 
buffer NEBuffer 1 (New England Biolabs) containing 20 
Jig/ml lambda DNA substrate and supplemented with SAM at 
100 MM final concentration. The reactions were incubated 
5 for 1 hour at 37^0 . The reaction products were analyzed 

by agarose gel electrophoresis on a 1% agarose gel in IX 
THE buffer. One of the two clones had Mmel endonuclease 
activity. This active clone was designated strain 
NEB1457 and was used for subsequent production of Mmel. 
10 The plasmid construct expressing Mmel activity in this 

clone was designated pTBMmel.l. 

EXAMPLE III 

15 THE Mnel ENDONUCLEASE PROVIDES IN VIVO PROTECTION 

AGAINST MMEI CLEAVAGE 

The plasmid pTBMnel . 1 was purified from NEB1457 
using the Qiagen miniprep protocol. This plasmid has two 

20 Mmel sites in the vector backbone, and one site within 

the Mmel gene. The plasmid was digested with Mmel to 
test whether this DNA was resistant to Mmel endonuclease 
activity, which would indicate that the single Mmel gene 
was able to methylate DNA in vivo to protect the host 

25 DNA against its endonuclease activity. To test this the 

following were combined: 

10 (Xl pTBMmel.l miniprep DNA 
15 M-l lOX NEBuffer 4 
30 15 Jil SAM (ImM stock solution) 

110 ^1 dH20 

1 ^il Mmel endonuclease (15 units) 

The reaction was mixed and split in thirds. To one 
35 third was added 0.5 m1 diH^O, to the second was added 0.5 

Ml pRRS vector and to the third was added 0 . 5 Ml PhiXl74 
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DNA as a positive control. The pTBMmel . 1 was not cleaved 
by the Mmel endonuclease activity, while the Phixl74 and 
pRRS DNAs in the same reaction were cleaved, indicating 
that the three Mmel sites in the pTBMmel.l DNA are 
resistant to Mmel endonuclease activity (Figure 4) . 

EXAMPLE IV 

lAtnel ENDONUCLEASE SENSITXVJTY TO METHYLATXON 



10 



The prior literature reports that Mmel endonuclease 
methylates just one strand of its recognition sequence, 
and that this hemi-methylation does not block subsequent 

15 cleavage of the DNA by the endonuclease (Tucholski, Gene 

223 (1998) 293-302). To test this a set of four 
oligonucleotides were synthesized so that a DNA 
substrate could be formed that was either unmethylated 
(oligo 1 + oligo 2), methylated in the top strand only 

20 (oligo 3 + oligo 2), methylated in the bottom strand 

only (oligo 1 + oligo 4) , or methylated on both strands 
(oligo 3 + oligo 4) - The oligos synthesized were: 



Oligo 1: 

25 

5 ' -FAM-GTTTGAAGACTCCGACGCGATGGCCAGCGATCGGCGCCTCAGCTTT 
TG-3 ■ (SEQ ID NO: 46) 



Oligo 2 : 

30 

5 • -FAM-CAAAAGCTGAGGCGCCGATCGCTGGCCATCGCGTCGGAGTCTTCA 
AAC-3' (SEQ ID NO:47) 



Oligo 3 : 

35 

5 • -FAM-GTTTGAAGACTCCG ( 6mA) CGCGATGGCCAGCGATCGGCGCCTCAGCTT 
TTG-3 • (SEQ ID NO: 48) 
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Oligo 4: 

5 • -FAM-CAAAAGCTGAGGCGCCGATCGCTGGCCATCGCGTCGG ( 6mA) GTCTTCA 
5 AAC-3' (SEQ ID NO: 49) 

(Other nucleotides outside the Mmel recognition 
sequence were also methylated for other studies, but 
since Mmel does not have any sequence specifity for 

10 these nucleotides this does affect Mmel activity and 

these other methylations are omitted here for clarity-) 
Duplex DNA was foirmed by mixing lOO/il top strand oligo 
(14mM stock) with lOO/xl bottom strand oligo (14/LiM 
stock) , heating to 85°C and cooling slowly to 3 0°C over 

15 a time of 20 minutes. Mmel was then used to cleave the 

oligo pairs in a 30 m1 reaction of IX NEBuffer4, 2.5 fM 
oligo, 100 MM SAM and 2.5 units Mmel. As a control, 
restriction endonuclease Hpyl88I was also used to cleave 
the oligo DNA. The HpylBBi recognition sequence overlaps 

20 the first 5 nucleotides of the Mmel recognition sequence 

in this DNA, 5 * -TONGA- 3 ' and is blocked by methylation 
at the adenine in either strand of the DNA. Mmel was 
found to cleave uranethylated DNA as expected. In 
contrast to previous teaching (Tucholski, Gene 223:293- 

25 302 (1998)) lyfrnel did not cleave the hemi-methylated DNA 

when the top strand only was methylated: 5 ' -TCCG {N6mA) C- 
3 ' - When the bottom strand only was methylated Mmel did 
cleave the DNA. When both strands were methylated Mmel 
did not cleave the DNA. (Figure 5) This finding is 

30 consistent with both the observed ability of the single 

Mmel enzyme to protect host DNA against cleavage in vivo 
and the observation that Mmel methylates only the top 
strand of its recognition sequence. We confirmed the 
report that Mmel enzyme methylates only the top strand 

35 of its recognition sequence by methylating the oligo 
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pairs above with tritium labeled H^-SAM, washing away 
the unincorporated SAM and counting the radioactivity in 
the DNA. Both the uninethylated oligo DNA and the top 
unmethylated, bottom methylated DNAs had greater than 
5 10- fold more counts than background, while the bottom 

unmethylated, top methylated DNA and the DNA with both 
strands methylated had coiints near background (Figure 
6) . These findings indicate that Mmel is a novel type of 
restriction modification system which does not require a 
10 separate methyl transferase enzyme to modify the host DNA 

to provide protection against the activity of the 
endonuclease, as is the case for the type IIG (also 
called type IV) enzymes such as Eco57I- 

15 EXAMPLE V 

DNA SEQUENCING and ANALYSIS 

DNA Sequencing: DNA sequencing was perfoioned on 
20 double- stranded templates on an ABI 373 or ABI 377 

automated sequencer. Amplified DNA fragments and 
individual clones were sequenced with primers 
synthesized as above or from xmiversal primers located 
in the vector. 

25 

Computer analyses: Computer analyses of the DNA 
sequences obtained were performed with the Genetics 
Computer Group programs (Deverenx, et al . , Nucleic Acids 
Res. 12:387-395 (1984)) and database similarity searches 
30 were performed via the internet at the National Center 

for Biotechnology Information site 

(http://www.ncbi.nlm.nih.gov/BLAST/) using the BLASTX 
and the BLASTP algorithms (Altschul, et al . , J. Mol . 
Biol 215:403-410 (1990) and Gish, et al . , Nature Genet. 
35 3:266-722 (1993)). 



